Round 1 โ Technical Screening
Format: 90-minute online assessment
DSA Questions
๐น Array-based Question
๐น Required efficient manipulation and traversal techniques.
๐น Focused on optimal time and space complexity.
Matrix-based Question (BFS)
๐น Problem involving Breadth-First Search (BFS) on a matrix.
๐น Tested understanding of graph traversal techniques and edge case handling.
SQL Question
A complex SQL problem involving:
๐น Multiple CTEs (Common Table Expressions)
๐น Sequential transformation of data
๐น Final output through joins and aggregations
Round 2 โ SQL Deep Dive
Introduction
๐น General background discussion and overview of previous roles.
SQL Scenarios
Rolling Mean per Product
๐น Given a product table with dates, implement a rolling mean over 3 consecutive dates.
๐น Used window functions with UNBOUNDED PRECEDING.
CTE + Join + RANK()
๐น Two tables provided.
๐น Required: A CTE, Join operation, RANK() window function
๐น Follow-up: Modify solution to use a LEFT JOIN for inclusive results.
SQL Concepts Discussion
Indexing Types:
๐น Clustered vs Non-clustered
๐น Bitmap, B-Tree, etc.
Partitioning:
๐น Horizontal vs Vertical
๐น Partition pruning techniques
File Formats:
๐น CSV, Parquet, ORC
๐น When to use which based on performance and use case
Round 3 โ Data Governance and Practices
Introduction
๐น Discussion around prior work experience and roles.
Behavioral & Conceptual Topics
Expedia Value โ โForce Simplicityโ
Asked for a significant real-life instance where I:
๐น Reduced complexity
๐น Created clear, impactful outcomes
User-Centricity
๐น Building with user empathy
๐น Gathering feedback and improving data products
Data Governance
๐น Access control
๐น Data lineage
๐น Auditability
๐น Metadata management
Data Cleaning & BI Practices
๐น End-to-end cleaning pipelines before BI integration
๐น Dealing with nulls, duplicates, inconsistent formats
Challenges & Learnings
Past challenges related to:
๐น Pipeline maintenance
๐น Stakeholder alignment
๐น Data quality